Add retry decorator to tests that are vulnerable to transient service issues by timmarkhuff · Pull Request #421 · groundlight/python-sdk

timmarkhuff · 2026-04-07T18:53:42Z

Some tests in python-sdk are vulnerable to bad responses from the cloud service. For example, an image query might get a result of STILL_PROCESSING, which means the cloud didn't have an answer in time. This is a transient error and will almost always be resolved with a retry.

This PR adds a retry decorator to protect such tests. Any test that submits an image query and asserts anything about the result are protected with this decorator.

I also found a few instances of tests that were not using our standard detector_name function for naming detectors. I fixed those too.

…on-sdk into tim/add-retry-decorator

brandon-wada

I'm fine with trying this out.

I think the central worry on something like this is that it might hide real flaky issues. However, we're currently fairly comfortable noting that the flakiness we observe is due to irregular usage patterns when 12 copies of the same tests are running in sync with each other via GHA. Given that, we're unlikely to be hiding issues that are relevant to any real usage, and we should be able to observe real issues in our BE alerting

timmarkhuff · 2026-04-08T20:34:56Z

I'm fine with trying this out.

I think the central worry on something like this is that it might hide real flaky issues. However, we're currently fairly comfortable noting that the flakiness we observe is due to irregular usage patterns when 12 copies of the same tests are running in sync with each other via GHA. Given that, we're unlikely to be hiding issues that are relevant to any real usage, and we should be able to observe real issues in our BE alerting

I think ideally, none of the SDK tests should be intended to uncover flakiness. Flakiness is better discovered through canary tests than unit tests in python-sdk.

timmarkhuff and others added 6 commits April 7, 2026 18:53

first pass

e9231c8

Automatically reformatting code

2dc545b

fixing some linter errors

7b5c442

Merge branch 'tim/add-retry-decorator' of github.com:groundlight/pyth…

aa2e72f

…on-sdk into tim/add-retry-decorator

responding to AI PR feedback

7824625

decreasing max_attempts

37954c3

timmarkhuff requested a review from brandon-wada April 7, 2026 20:38

timmarkhuff changed the title ~~Add retry decorator~~ Add retry decorator to test functions that are vulnerable to transient service issues Apr 7, 2026

timmarkhuff changed the title ~~Add retry decorator to test functions that are vulnerable to transient service issues~~ Add retry decorator to tests that are vulnerable to transient service issues Apr 7, 2026

brandon-wada approved these changes Apr 7, 2026

View reviewed changes

decorating a few more functions

74af6e3

timmarkhuff merged commit 44f1732 into main Apr 8, 2026
15 of 16 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add retry decorator to tests that are vulnerable to transient service issues#421

Add retry decorator to tests that are vulnerable to transient service issues#421
timmarkhuff merged 7 commits intomainfrom
tim/add-retry-decorator

timmarkhuff commented Apr 7, 2026 •

edited

Loading

Uh oh!

brandon-wada left a comment

Uh oh!

timmarkhuff commented Apr 8, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

timmarkhuff commented Apr 7, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

brandon-wada left a comment

Choose a reason for hiding this comment

Uh oh!

timmarkhuff commented Apr 8, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

timmarkhuff commented Apr 7, 2026 •

edited

Loading